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Cluster Masses from CMB and Galaxy Weak Lensing 
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Gravitational lensing can be used to directly constrain the projected density profile of galaxy clus¬ 
ters. We discuss possible future constraints using lensing of the CMB temperature and polarization, 
and compare to results from using galaxy weak lensing. We model the moving lens and kinetic SZ 
signals that confuse the temperature CMB lensing when cluster velocities and angular momenta are 
unknown, and show how they degrade parameter constraints. The CMB polarization cluster lensing 
signal is ~ 1/rK for massive clusters and challenging to detect; however it should be significantly 
cleaner than the temperature signal and may provide the most robust constraints at low noise levels. 
Galaxy lensing is likely to be much better for constraining cluster masses at low redshift, but for 
clusters at redshift z > 1 future CMB lensing observations may be able to do better. 


I. INTRODUCTION 

The distribution of clusters of galaxies as a function of mass and redshift depends on the cosmological model, 
and can be modelled increasingly accurately 0,1110 Observations of clusters can therefore be used to learn 
about cosmology, as well as to test models for cluster formation and evolution. Observations of the thermal Sunyaev- 
Zel’dovich (SZ) effect 0110 are a powerful probe of the cluster gas, but do not measure the mass directly. To relate 
the gas properties to the total mass involves modelling potentially complicated baryonic gas physics. By contrast 
gravitational lensing probes the projected total mass, not just the gas, and therefore can provide direct information 
about cluster masses. In this paper we analyse the potential for reconstruction of parameterized cluster profiles 
from future observations of cluster lensing of the CMB and weak lensing of distant galaxies. For the first time we 
include constraints from CMB polarization, and also include a model of the moving lens effect that confuses the CMB 
temperature signal when the clusters have unknown velocities. The CMB polarization signal is much cleaner than 
the temperature at low noise levels, and may prove to be a good way to constrain cluster masses at high redshift. 
We perform an essentially optimal statistical analysis in the approximation that the unlensed fields can be treated as 
Gaussian. 

Current CMB observations are not of high enough resolution or sensitivity to measure the cluster lensing signal. 
However, future missions aimed at detecting small levels of primordial gravitational waves via their distinct R-mode 
polarization signal will require both high sensitivity and resolution. This is because lensing by large scale structure 
can convert scalar .E-modes into H-modes, and hence this lensing signal has to be subtracted to extract a small 
primordial E-mode signal from gravitational waves. The lensing reconstruction requires high resolution observations 
in order to have enough information to solve for both the primordial E-modes and the unknown large scale structure 
distribution ;6]. It is therefore of interest to see what other useful information can be gained from such future high 
resolution observations. The resolution required for E-mode cleaning is probably rather less than needed for good 
cluster mass constraints from CMB lensing, however it is clearly of interest to see what can be gained from observations 
with slightly higher arcminute-level resolution. Cluster lensing of CMB temperature and polarization that we consider 
here is one potentially useful possibility. 

Cluster lensing also generates a shear field that is observable by looking at the shapes of galaxies lying behind 
the cluster. This method of cluster mass constraint is promising and possible with current observations, though at 
some level has to be limited by the finite number of source galaxies available behind the lens. The seminal paper 
of Kaiser & Squires [fj describes how to do a non-parametric mass reconstruction; this technique and its variants 
have been applied to numerous clusters (e.g. Refs. (|, Q). Here we focus on parameterized cluster models, which 
enable statistical comparisons to be made between clusters and as a function of observational strategy. The likelihood 
techniques are based on those developed in Refs. mm- We shall investigate how galaxy lensing compares to CMB 
lensing as a function of noise level, cluster redshift and galaxy number count. We use natural units where the speed 
of light is unity. 
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FIG. 1: Simulated effect of cluster lensing on the CMB temperature. Left: the unlensed CMB; middle: the lensed CMB; right: 
the difference due to the cluster lensing. The cluster is at z = 1, and has a spherically symmetric NFW profile with mass of 
M 200 = 10 15 hr 1 Mq and concentration parameter c = 5. Distances are in arcminutes, and can be compared to r 2 oo = 3.3 arcmin. 
This is a rather clean realization; in general the dipole pattern can be weaker and/or more complicated. Note the inverted 
direction of the gradient within the arcminute-scale Einstein radius in the middle figure. 



II. CMB LENSING 


A. CMB temperature lensing 


The unlensed CMB is very smooth on small scales due to diffusion damping, so the small scale unlensed CMB can 
be locally approximated as a gradient. Clusters act as converging lenses, making CMB photons appear to originate 
further from the centre of the cluster than they actually do. So the side of the cluster on the cold side of the gradient 
will look hotter after lensing, and that on the hot side will look colder, giving a distinctive dipole-like signature aligned 
with the direction of the background CMB gradient. The CMB lensing signature therefore consists of small scale 
wiggles in the observed temperature (and polarization) in an otherwise smooth background in 0111 Elia, a 
particular example is shown in Fig. ^ Note that within the Einstein radius the lensing is not strictly weak (though 
deflection angles remain small), however we shall include the signal everywhere as the strong lensing signal on the 
CMB is no more difficult to model than the weak signal in the single thin lens approximation that we use. 

For a Gaussian unlensed temperature field 0, the temperature gradient variance is given by 

(|V0| 2 ) = ^/(Z + 1)^Q 6 , (1) 

1 


where Cf is the unlensed CMB temperature power spectrum. For typical ACDM models that we consider here the 
rms is ~ 14/rK/arcmin. Since massive clusters can give deflections of the order of an arcminute, the signal is expected 
to be at the ~ 10/iK level. The scale of the dipole-like pattern induced by the cluster lensing is much smaller than the 
scale of fluctuations in the unlensed CMB, and so should in principle easily be observable with high enough resolution 
and low enough noise. However the signal depends on the background gradient, so only clusters in front of a significant 
gradient can have their mass constrained this way. The gradient at a point is a Gaussian random variable, so how 
often this happens will depend on how sensitive the observations are to small signals and the level of complicating 
signals acting as sources of correlated noise. 

The observed direction of a point on the CMB last scattering surface is related to the direction it would have had 
without lensing by a deflection angle a, determined in the small-angle Born approximation by 

r Xs 

a(n) = -2 / dx 
Jo 

where h is the direction of observation, \S is the comoving distance to the source at the last scattering surface (taken 
to be thin), rj is the time at which the photon was at position yn, and T is the Newtonian potential. For cluster 
lensing the integral is dominated by the small part through the thin cluster and the angular factor 1 — Xl/xs may 
be taken out of the integral, where xl is the distance to the cluster. The potential is related to the comoving density 
perturbation via the Poisson equation. For a more detailed review of CMB lensing see Ref. E 

If the background gradient could be measured cleanly away from the cluster, the cluster deflection angles could be 
reconstructed directly, and hence used to solve for the cluster profile and mass given certain assumptions. Unfortu¬ 
nately the situation is more complicated because the unlensed CMB isn’t exactly a gradient — clusters have finite 
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angular size so the unlensed CMB in reality has more complicated spatial structure. There is also additional small 
scale power due to other lensing sources along the line of sight, not to mention other important non-linear effects. 
Fortunately the problem is statistically straightforward if we take the unlensed CMB field to be Gaussian. For a given 
deflection field the lensed CMB is also Gaussian since the deflections just re-map points: a uniform sampling of the 
lensed CMB just corresponds a non-uniform (and possibly multiple) sampling of the unlensed CMB. The correlation 
function of the observed temperature is therefore just given by the correlation function at the undeflected position on 
the last scattering surface (neglecting complications due to other lensing along the line of sight). We can therefore 
work out the likelihood of any given cluster deflection field a(9) for some set of cluster parameters 9 using 

-21ogP(a(0)|0) = 0(x l )C" 1 (x i ,x J )0(x J ) +log|C'(xj,x j )|, (3) 

where i,j index the different observed positions (taken here to be pixel centres) which are summed over implicitly in 
the first term. Here C is given by the pixel noise covariance plus the covariance matrix determined by the correlation 
function of the unlensed temperature field 0: 

9/ I i 

Ce(x,x') = (0(x)0(x')) = C e ((3) = Y — 1 Cf P ; (cos/3), (4) 

i 

where (3 is the angular separation between x and x'. Here we only consider isotropic white (uncorrelated) noise a^(x) 
so that 

C(xi,Xj) = Ce(*i + ati,Xj + aj) + 5 ij al r (x i ). (5) 

Non-zero noise regularizes the inverse, for example the case when two observations on the Einstein ring are actually 
sampling the same point on the last scattering surface. The effect of lensing by other perturbations along the line of 
sight can be crudely modelled by using the lensed power spectrum, though a more careful analysis would require a study 
of the non-Gaussian distribution numerically using simulations. We shall only consider small pixelized observations, in 
which case the dimension of the observed data vector is quite manageable, and the covariance matrix can be inverted 
exactly for each set of cluster model parameters considered. For discussion of beam issues see Section urn 

Unfortunately the lensing cannot be observed directly, only its mixture with numerous other non-linear signals 
including the Sunyaev-Zel’dovich (SZ) mss and Rees-Sciama 0 effects. Thermal SZ is in principle not a problem 
because of its distinctive spectral signature, and indeed could be used to help model the kinetic signal due to their 
correlated sources. Kinetic SZ is probably the most problematic [2], and has the same spectrum as the lensing signal 
we are interested in. For a cluster that has circular symmetry about the line of sight, the kinetic SZ signal should also 
have circular symmetry, and therefore be orthogonal to the dipole-like lensing signal. However there will in general 
be non-symmetric spatially varying kinetic SZ both from cluster internal motion and other gas along the line of sight 
that is more problematic. Cluster motion transverse to the line of sight can also give a kinetic Rees-Sciama signal, 
and secondary signals from outside the cluster can contribute additional sources of noise. 

We can easily include a known non-lensing secondary contribution © m into the posterior distribution, 

—2 log P(0|0 tot , 0 m ) = (0 tot - 0 m ) t C'- 1 (0 tot - 0 m ) + log \C\, (6) 

where © tot is the observed temperature (a sum of the lensing signal and other secondary signals). In general the 
other secondaries will also depend on the parameters 9 and are hard to model. 


Moving lens effect 


If the cluster has a significant velocity transverse to the line of sight there will be a moving lens signal 000112: 
a photon passing the front side of the lens (with respect to the direction of motion) will see a weaker potential on 
its way into the lens than on the way out, and hence receives a net redshift. Similarly a photon on the other side 
will receive a net blue shift. The moving lens therefore induces a dipole-like temperature anisotropy, which is at the 
/rK level for massive clusters 0 |. This is essentially the kinetic component of the Rees-Sciama effect (the nonlinear 
growth component gives a small non-dipole-like effect that we neglect). The moving lens signal can also be viewed 
as dipole lensing: in the rest frame of the cluster, the cluster sees a CMB dipole; lensing deflects photons so those 
appearing to come from the front hot side actually come from closer to the cold side, and hence appear colder, and 
conversely on the other side. 

The moving lens dipole-like signal is easy to model 00. The temperature anisotropy induced by a small lens 
moving perpendicular to the line of sight with velocity vj_ (natural units) is at lowest order 
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where 8(3 is the deflection angle of the photon at the lens (related to the observed deflection angle by a = (1 — 
Xl/x*)^P)- The moving lens signal therefore looks very similar to the dipole-like lensing signal, though unlike 
the lensing signal the direction of the dipole pattern is determined by the velocity rather than the (uncorrelated) 
background CMB gradient. The moving lens signal is a source of confusion for the static lensing signal if there is no 
way to measure the transverse cluster velocity independently (e.g. using CMB polarization). 

If we model the cluster peculiar velocities as a 3-D Gaussian random field with rms velocity u rm s, the transverse 
velocity is also Gaussian with (vj_) = (v% + Vy) = 2v^ ms /3. Writing © m = A© = <98(3 ■ vj_ = v x & m ^ x + v y ® m:V we 
then have 

C m = <© tOt 0 tot t) = <©0+ + 0 m ©U =C + ^ + & m ,y&L, y ) ■ (8) 

Here we have assumed any correlation between the CMB and velocity is negligible so the fields are uncorrelated. 
Marginalized over the transverse cluster velocity the likelihood is therefore 

—21ogP(0|© tot ) = ©‘^C- 1 © 40 * +log|C m |. (9) 

Note that in general v rms is a function of the cluster parameters and redshift. The velocities of nearby clusters will 
also be correlated to some extent, in which case treating the velocity uncertainty as independent Gaussians would not 
be quite correct. We may also be interested to extract information about the transverse velocity, in which case the 
velocity posterior distribution could be calculated rather than marginalizing over it as we do here. 


Kinetic SZ from cluster rotation 

In the ideal symmetric case adding a symmetric kinetic SZ template has no effect on parameter constraints because 
the affected modes are orthogonal to those sensitive to the dipole-like lensing signal. This is true even though the 
kinetic SZ is expected to have significantly larger amplitude. The extent to which asymmetric kinetic SZ confuses 
the signal really needs to be tested from simulations I15L llfl . which indicate that in practice it is a major source of 
confusion. 1 Here we consider only a simple analytic model. 

A simple situation that can give a dipole-like kinetic SZ signature is when the cluster is rotating [2~3 . l2fH| . We can 
model this easily even in the idealized case following Ref. |25|]. We assume the cluster is rigidly rotating inside the 
virial radius, and that that the gas is in hydrostatic equilibrium in a non-rotating dark matter potential (2^| . The 
kinetic SZ temperature anisotropy is given by 

dxn e n-v, (10) 

where the integral is along the line of sight in direction n, and the gas velocity is determined from the radius, 
mass and the cluster angular momentum, the latter being parameterized by the dimensionless parameter A following 
Refs. (25|(27j]. The electron density n e is assumed to be associated with the fully ionized gas. We shall make the 
crude approximation that the angular momenta have a 3-D Gaussian distribution, with A rms = 0.04 as in |2^|. This 
rms value is broadly consistent with that found in simulations |27| . Assuming we have no knowledge of the cluster 
rotation, we can then marginalize over the kinetic SZ signal in exactly the same way that we did for the moving 
lens signal. The kinetic SZ signal from rotation gives a dipole-like pattern peaking at a few /iK near the centre, but 
falls off rapidly at large radii (well inside the virial radius). As for the moving lens signal the direction of the dipole 
is expected to be uncorrelated with the background CMB gradient, and we assume it is also uncorrelated with the 
transverse velocity. 

The components of the angular momentum transverse to the line of sight could be measured by other means, for 
example from the redshifts of the galaxies, so in principle it may be possible to model this signal out. However 
it is useful to consider it as an indicator of the level of problems caused by more general kinetic SZ signals from 
substructure and internal motion. 

Since we wish to extract constraints from CMB lensing here rather than by modelling the (in reality complicated) 
kinetic SZ, we shall take the fixed fiducial parameters when calculating the kinetic SZ contribution to the covariance. 
If the kinetic SZ could be modelled reliably as a function of cluster parameters then parameter constraints could be 
improved. 



1 Ref. 0 find a small effect from kinetic SZ, but they only consider the symmetric component. 
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Background secondaries 

On scales l > 4000 the spectrum from inhomogeneous reionization and secondary doppler signals is expected to be 
very roughly scale invariant with 1(1 + l)Ci/2n ~ 5/rK 2 f28| and dominates the (cluster-free) lensed CMB power. The 
extra power provided by this signal actually increases the rms temperature gradient behind the cluster significantly, 
which tends to increase the lensing signal. This is compensated by the increased smaller scale power, acting as a 
source of correlated noise, and degrading parameter constraints. Neglecting non-Gaussianity and the different redshift 
of the source, it can be modelled simply by adding the approximately scale invariant power spectrum to the unlensed 
power spectrum used for computing the unlensed covariance. 

In principle, discrete secondary sources behind the cluster will be sheared by the lens, so shear measurements can 
be used to obtain additional information about the cluster (29j| . This would require substantially higher resolution 
observations, and a detailed investigation is beyond the scope of this paper. 


B. CMB polarization lensing 

The CMB temperature cluster lensing signal is complicated by many sources of confusion, including sizeable kinetic 
SZ and moving lens signals. The lensing signal is also small if the temperature gradient behind a cluster happens 
to be small. Furthermore any attempted density profile reconstruction will have degeneracies because the single 
temperature gradient will not show up orthogonal deflection angles. In principle CMB polarization observations can 
help with all of these problems. 

The statistics of the polarization gradients is discussed in the appendix. The rms gradient of each Stokes’ parameter 
is ~ 1 /iK /arcmin, so a signal ~ 1/iK is expected for massive clusters. On cluster scales the correlation with the 
temperature gradient is fairly small, at the 10% level, so the directions of the gradients are close to independent. The 
addition of polarization data is therefore very much like adding two new temperature fields but with lower signal and 
slightly different CMB ‘noise’ properties. 

The polarization signal is about a factor of ten lower than the temperature signal, so detection will be challenging. 
However the polarization signal should be significantly cleaner as the other secondary polarization signals are generally 
small JsOi El HE HE Sj- The dominant frequency-independent signal is expected to be from scattering of the 
primordial CMB temperature quadrupole from ioniz ed g as in the cluster. This depends linearly on the cluster optical 
depth (~ 1%), and typically gives a signal < O.l^iK [35l HE HE HE■ This signal is strongly correlated across the sky 
for clusters at similar redshifts due to the small variation of the quadrupole within a Hubble volume, and is also fairly 
constant across each cluster (in contrast to the lensing dipole-like signal). It may therefore be possible to subtract it 
out. For spherically symmetric clusters the signal should be orthogonal to the lensing signal and hence irrelevant, so 
we shall neglect it here. There will however be hard to model spatial variations due to varying quadrupole and ionized 
gas density within the cluster that prevent this being done perfectly, but we can expect residuals to be <C 0.1/zK. 

The dominant frequency-dependent signal is likely to be re-scattering of anisotropic thermal SZ, giving a frequency- 
dependent cluster polarization signal < 0.7/iK HEH3 at the peak of the SZ spectrum. Multi-frequency observations 
should be able to subtract this out, and lower frequency observations would in any case see significantly less signal. 
Polarization from scattering of the kinetic quadrupole is expected to be around ten times lower than the signal from 
scattering of the primordial quadrupole H71 Idtif . has a different frequency dependence, and should anyway be 
orthogonal to the lensing signal at lowest order. Contributions from cluster rotation are expected to be very small, 
~ 10 -4 /zK p4|. Other signals such as Faraday rotation also have a distinct spectral signature so in principle they 
can be separated out us ing multi-frequency observations. The small scale signal from inhomogeneous reionization 
is expected to be small |4(j,|41|, and below the signal expected from other lenses. We shall simply assume that all 
non-lensing signals can be removed or are negligible, and roughly model the small effect of other lenses along the line 
of sight by using the lensed CMB polarization power spectra. 

The unlensed polarization fields are expected to be Gaussian, and the full polarized posterior can be computed as 
for the temperature using the correlation functions between the Stokes’ parameters at the undeflected positions. The 
polarization field can be described as a complex spin two field P = Q + iU. The scalar correlation function between 
polarization at x and x' should be independent of the basis used to define P at the two points. To do this, we want 
to describe the polarization in the physically relevant basis defined by r = x - x'. If r makes an angle <p r to the e x 
axis, this amounts to rotating the basis by an angle <p r anticlockwise at each point, giving P r (x) = e _2l< ^ r P(x). In 
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this physical basis we can then define the basis-independent correlation functions (13, (42?) 

07 i 1 

MV) = (Pr(x)*P r (x')) = (P(x)*P(x')> = £ -^(Cf + Cf )4 2 (/3) (11) 

V 

o/ i 1 

e_(/3) = (P r (x)P r (x')> = (e- 4i ^P(x)P(x')) = ^ -^(Cf - C'f)4- 2 (/?) (12) 

l 

07 I 1 

W) = <0(x)P r (x')> = (0(x)e- 2 ^P(x')) = £ ^^4(/3). (13) 

l 

Here Cf and C B are the P-mode (gradient-like) and P-mode (curl-like) power spectra 0, and Cf Y is the cross 
correlation of the P-modes with the temperature. The d! rrm are the reduced Wigner functions (see e.g. Ref. 

Note that a pure polarization gradient is neither unambiguously P or B , as the decomposition is non-local and 
depends on second derivatives. The decomposition into P- and P-modes is not especially helpful for analysing the 
cluster polarization signal: on cluster scales both the unlensed P- and P-mode power spectra are expected to be 
small, with lensing introducing approximately equal small scale power into both (see the appendix). For Gaussian 
fields an optimal analysis can be performed using the Stokes parameters directly without decomposition into P- and 
P-modes. 

Writing the observed pixel temperature, Q and U as a combined vector (0, Q, U), these correlation functions are 
sufficient to calculate the full covariance matrix that we need for computing the likelihood function given by the 
obvious generalization of Eq. ©• Since the temperature signal is likely to be complicated by secondary signals, 
we shall mostly consider the polarization measurements separately from the temperature, though including the full 
correlation structure is no problem if required. Since the polarization signal is so much smaller than the temperature, 
any noise level that allows polarization detection will generally give a much better constraint from the temperature 
if the temperature signal is clean enough. However in practice confusion with other secondaries may make the 
polarization more useful at low noise levels. 


C. Beam and pixelization 

The effect of beam convolution is complicated because the beam convolves the lensed rather than unlensed sky. An 
effectively circular Gaussian beam will in general measure a non-circular non-Gaussian average of the unlensed sky 
due to the non-uniform lensing field. If the unlensed sky and deflection angle over a pixel can be well approximated 
by a gradient, a pixel (or pixel-size beam) average will be very close to the value at the pixel centre, which is what we 
use here. However inside the Einstein radius the deflection angle gradient can change significantly on the pixel scale, 
and a more careful analysis would be required for very accurate results. 

If the centre of a circular pixel is aligned with the centre of a spherically symmetric cluster, an integral over the 
pixel should give the same as the value at the centre of the pixel, which should be the same as the unlensed value. 
In this case the pixel gives essentially no information about the cluster. However if the grid of pixels is offset from 
the centre of the cluster, two adjacent central pixels will have different signals and slightly more information can be 
gained about the central cluster profile. This manifests itself as a somewhat better upper limit on the concentration 
parameter c when pixels centres are offset. Since a generic observation will not be aligned exactly with the cluster 
we have chosen to offset our pixels so as not to introduce artificial symmetries. However there is some error from 
using the values at the centre of the pixels. We find that for the basic case the results are fairly insensitive to the 
pixel integration (checked explicitly by calculating the covariance for pixel values taken to be an average of 4 or 
9 sub-pixels). The behaviour is more complicated when there is significant small scale unlensed power, eg. from 
inhomogeneous reionization. We do not attempt to model the beam effect in detail here. When small scale unlensed 
power is included our results should be taken as an approximate estimate; a more detailed analysis would be needed 
for any given actual experimental beam. For distant clusters with masses ~ 2 x 10 14 Mq the Einstein radius is around 
0.2arcmin, which for aligned pixels would lie entirely within a central pixel of side 0.5arcmin. In this case the result 
from using aligned pixels (where the central pixel contributes no information) should give an idea of the result when 
the strong lensing region is ignored, which corresponds to somewhat increasing the upper limit on c compared to the 
results we present using all the (offset) pixels. 

The effect of pixel (beam) size and cluster redshift is explored in detail in Ref. Q. Here we shall fix the pixel 
size to 0.5arcmin, and use a square box of side 8arcmin (16 pixels) nearly centred over the cluster. Due to the small 
scale power in the CMB, using larger box sizes gains very little since the cluster signature cannot be distinguished 
from variations in the primordial CMB, and our results are stable to increasing the box size. We use the flat sky 
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approximation, which should be very accurate on cluster scales. The sub-arcminute resolution we are considering is 
somewhat beyond the capabilities of currently planned CMB experiments, but could be achievable in future. 


III. GALAXY LENSING 

Galaxies lying behind a cluster are assumed to have uncorrelated shapes. Since we do not wish to assume anything 
about the spatial distribution of the unlensed galaxies, observing the lensed galaxies tells us nothing directly about the 
deflection angle. However lensing does shear the galaxies such that the observed shapes after lensing are correlated. 
Specifically, any function of the shape that transforms under shear like an ellipticity will give an unbiased estimator 
of the shear due to lensing. Observations of the lensed galaxy shapes can therefore be used to constrain the cluster 
profile via the observed shear. Lensing also modifies the number counts of galaxies behind a cluster, and this effect can 
also be used to probe the cluster profile. However, in practice, shear is predominantly used rather than magnification, 
since it requires no external calibration (see Ref. [H3)- 

We shall not be concerned with the details of galaxy shape measurement here, and only consider the ideal case in 
which the point spread function can be accounted for exactly and there are no other observational systematics. We 
assume some shape measurement e = e + + ie x that transforms like an ellipticity, gives an unbiased estimator of the 
reduced shear, (e) = g, and that it can be measured exactly. The quantity e+ is an ellipticity in the direction of some 
chosen basis axes, and e x is the corresponding ellipticity in a basis rotated by 45°; we work in a polar basis, centred 
on the cluster. We neglect any small correlations in the unlensed galaxy shapes, and take the galaxies to be essentially 
point-like with known position x. The dispersion of the intrinsic shapes of the galaxies provides a source of ‘noise’ on 
the shear, and means that the shear cannot be measured perfectly because there are only a finite number of lensed 
galaxies. 

Our measure for (complex) ellipticity |e| exp(2ic(>) is such that its modulus corresponds to (1 — r)/(l + r) where r 
is the galaxy’s axis ratio, and <fi is its position angle. The distribution of galaxy ellipticities is non-Gaussian; however 
for our purposes we shall approximate the distribution as a Gaussian with some variance cr 2 , which is related to the 
unlensed variance cr 2 through 0 


~ (i - Isl 2 ) o-u, (14) 

which is a reasonable approximation for weak shears. The likelihood function for one galaxy is then given to within 
a constant by 


—2 logP(0|ej, Zi,Xi) = 2|e< - g{9, Xj, Zi)\ 2 /a 2 + 21ogcr 2 . (15) 

Here Zi is the redshift of each source galaxy (we assume the cluster redshift is well known). In the case when the 
source redshifts are not known exactly, the marginalized probability is given by 

P(0|e f ,x*)= f dziP(9\ei,Zi,Xi)P(zi) (16) 

where P(zi) is the probability distribution for Zi- For z % less than the cluster redshift the shear is zero, so P(9\ci, Zi , x,) 
is independent of 9. Since we assume the intrinsic ellipticities are uncorrelated and neglect systematics, the probability 
from observations of N galaxies is just the product of the probability from each. 

Since these results are only valid in the weak lensing regime, we exclude the central region of the cluster around 
the Einstein ring; the same physical region is excised for a cluster of a particular mass. This is of the order of an 
arcminute for massive clusters. 


IV. EXPECTED RESULTS 
A. Mean log likelihoods 

We follow Ref. |n| by calculating the expected log likelihood for a set of cluster parameters 9. For some fiducial 
model with parameters 9q this is given by (log P(0|data)) where the average is over data realizations that could come 
from the 9q model, given the noise properties. For observations of a large number of identical clusters, this mean log 
likelihood gives the average contribution of one cluster to the total joint log likelihood. For the CMB we have 

-2(logP(0|0 o )> =Tr [C{9 0 )C-\9)\ + log|C(0)| 


(17) 
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where C(ff) is the correlation matrix for the undeflected points. The mean log likelihood peaks at the true model 
9 = 9 0 , and the shape of the likelihood contours give a good idea of the degeneracy directions expected from particular 
observations. For the temperature case C is replaced with C m when we want to marginalize over the moving lens or 
rotating SZ signals. 

The mean log likelihood for one galaxy at redshift 2 is 0 


- 2(log P(9\9 0 ,x,z)) 


-2 J d 2 eP(e| 0 o )logP( 0 |e, 2 ,x) 

I g{9,x,z) - g(9 0 ,x,z)\ 2 + al 


+ log a‘ 


(18) 


where ao denotes er (g (9 q)). Given the probability P(N ) to find N galaxies in the data field (which follows a Poisson 
distribution and is independent of position) the number density observed n(9o, x, z) depends on the magnification due 
to the lensing. We follow Ref. [k)J by taking the scaling of number density with magnification p to be [p(9o, x, 
where (3 is the slope of the unlensed number counts, taken to be 0.5. The number density function averaged over the 
total number density distribution is then 

{n(9o, x, z)) N = J dN[fi(0o, x, z)] l3 ~ 1 NP(N)P(z) = n 7 [^(0 o , x, z)] f3 ~ 1 P(z), (19) 


where n 7 is the expected unlensed angular number density (including all redshifts) for which a shape can be measured. 
The source galaxies have a redshift probability distribution P(z) normalized so / 0 °° dzP(z) = 1. The total mean log 
likelihood is then 


-2(logP(0|0„)> 


-2 J dz J d 2 x(n(0 o ,x, z)) N {logP(9\9 0 ,x, z)) e 

2 n 7 /d 2 P( 2 ) j d^(9o ,x, *)]'-! ( I g(g,x,,)- g (g 0 ,x,,)|^ + ag +lo8 ^ 


( 20 ) 


as derived in Ref. 0 - Monte Carlo simulations were also performed in Ref. 0 which showed that the scatter 
of maximum likelihood points in different realizations corresponds well with the mean log likelihood contours. For 
galaxies with redshift less than the cluster the shear is zero, and there is no contribution to the integral (except an 
irrelevant constant). This result is valid when the observations measure the redshift of each galaxy accurately. In the 
case when the redshifts are uncertain the result is more complicated 


-2<logP(0|0 o )> 


-2 J dz J d 2 x (n(0 o , x, z)) N (log P(6\6 0 , 
-2n 7 j dzP(z) J d 2 x [p,(9 0 , x, z]f~ l J 


x ))e 

d 2 e P(e\9o, z,x) log 


J dz'P(9\e, z\ x)P(z'\z) 


,( 21 ) 


which cannot easily be simplified any further. Here P(z'\z) is the post-observation distribution of the redshift z' given 
the galaxy is actually at 2 . Note that Eq. (20) reduces to Eq. (19) when the source redshifts are known. 

Throughout we shall assume a purely adiabatic standard ACDM cosmology with Hubble parameter Hq = 
70kms _1 Mpc _ , dark matter density Q c h 2 = 0.11, baryon density fl^h 2 = 0.022, spectral index n s = 0.99, pri¬ 
mordial curvature perturbation amplitude A s = 2.5 x 10 -9 (corresponding to matter fluctuation parameter today 
as ~ 0.87) and optical depth r = 0.15. 


B. Spherically symmetric NFW clusters 

Our main purpose here is to compare the power of different sources for cluster mass lensing reconstruction. We 
therefore use a simple parameterization for the cluster radial profile, and see how well the parameters can be con¬ 
strained using different methods. We shall follow Ref. 0 and assume clusters with a spherically symmetric NFW 
profile l4fil given by 

A 

r(cr + r 2 00) 2 


p(r) = 


( 22 ) 
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for some concentration parameter c, and radius r 2 oo ■ We shall parameterize the mass of the cluster by that inside the 
radius r 20 o defined so that it is 200 times the mean density of a critical density universe at the cluster redshift, 

M 200 = 200—p c (z)r 20 oj (23) 

where p c {z) is the critical density at the redshift of the cluster, 3H 2 (z)/8nG. The amplitude is given in terms of the 
mass and concentration parameter by 


A = 


M 200 c 2 


47 r[ln(l + c) — c/(l + c)] 


The deflection at the cluster is then given by 0 


- / ■. — IQttGA , , 

53{r) =--- F(r r s ) r, 


c r s 


(24) 


(25) 


where r is the transverse distance from the centre of the cluster, f = r/r, r s = r 2 oo/c is the scale radius, and 


F(x < 1) 
F(x = 1) 
F{x > 1) 


\ ^ln(a;/ 2 ) 

1 - ln( 2 ) 
i ^ln(a;/ 2 ) 


ln(x/[l — ypF- 
\J\ — x 2 



7r/2 — sin 1 (l/a;)^ 

y/x 2 — 1 / 


(26) 


The observed deflection is then given by 59 = (1 — Xl/xs)^P where xl and xs are the comoving distances to the lens 
and the source (for the CMB taken to be the point of maximum visibility on the last scattering surface). 

The convergence k is (minus one half) the angular divergence of the observed deflection angle, and at the radial 
distance r from the centre of the cluster is given by (see e.g. Ref. 4dj]) 


K.(r) = K k f{r/r s ), 


where f(x) = x 1 ^[iF(a;)] is 


Here 


f{x < 1 ) 
fix = 1 ) 
fix > 1 ) 



2 tanh 1 ^/(l — x)/(l + x) 
a /1 — x 2 


2 tan 1 y/(x — l)/{x + 1 ) 
y/x 2 — 1 


Kk = 


2 x s p s 
^crit 


87 tGA 

2 2 
c 2 rj 


DlO- - Xl/xs), 


(27) 


(28) 


(29) 


where E cr jt = [47 tG.Dl(1 — Xl/xs )] _1 is the (redshift dependent) critical surface mass density of the lens and the 
angular diameter distance to the lens is Dl. The characteristic density of the halo p s is related to the amplitude A 
through 



In a polar basis centred on the cluster the shear 7 is real and given by 


(30) 


7 ( r ) = K k j(r/r s ) , 


( 31 ) 
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FIG. 2: Mean log likelihood constraints for a M 200 = 10 15 /i _1 Mq cluster with concentration parameter c = 5 using CMB 
temperature lensing only. Dark solid is CMB lensing for 1/rK noise per 0.5 arcmin pixel assuming only lensing signal, paler 
solid is when a moving lens contribution is added and constraints are marginalized assuming a Gaussian velocity distribution 
with Vrms = 300kms _1 . Dashed lines show the constraints marginalized over an unknown rotating kinetic SZ contribution. 
Contours show where the exponential of the mean log likelihood drops to 0.32 and 0.05 of the maximum. 



where 


4 tanh 1 y / (l — x)/{\ + x) 21n(|) 1 2 tanh 1 ^/(1 — x)/{l + x) 

x 2 ^/l — x 2 x 2 x 2 — 1 ( x 2 — 1) Vl — x 2 

j( x = 1) = 21 »(}) + 3 

v 4 tan -1 yj(x —l)/(x + \) | 21n(|) 1 > 2 tan -1 -^/(x — l)/{x + 1) 

j[x > lj = - I o 2 7 TTi 773 

x 2 Vx 2 — 1 m tc — 1 (a: 2 —l) 2 

The reduced shear g follows from g = 7/(1 — k) and the magnification /z = 1/ ((1 - k) 2 - I 7 I 2 ). 


(32) 


C. CMB lensing 

The parameters we shall attempt to constrain are the concentration c and the mass parameterized by M 2 oo- For 
simplicity we shall assume the position of the centre is known (for example from observations of the thermal SZ), so 
there are only two parameters. 

Since our model is spherically symmetric the kinetic SZ from line of sight motion is also symmetric about the 
cluster centre, and therefore orthogonal to the lensing signal. It can therefore be neglected. We show the effect of 
marginalizing over the various asymmetric signals on massive cluster constraints in Fig. [ 5 ] As a rough model of the 
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FIG. 3: Mean log likelihood constraints for a M 200 = 2 x 10 14 /i _1 AFq cluster with concentration parameter c = 5 using CMB 
lensing only with noise 0.1/xK per 0.5arcmin pixel. Filled contours are for temperature lensing, from the inside out (dark to 
light) they are: 1. CMB lensing only; 2. Marginalized over moving lens signal; 3. Marginalized over moving lens signal and 
including small scale power from inhomogeneous reionization. The dashed line shows the result when the kinetic SZ signal 
from cluster rotation is unknown and also marginalized. The solid unfilled contour shows the result from a clean polarization 
observation with noise v2 x 0.1/xK on each Stokes’ parameter. Contours enclose the area where the exponential of the mean 
log likelihood is greater than 0.05 of the peak value. 


moving lens signal we take a constant u rms = 300kms _1 (see e.g. Ref. H^j). Although only a fairly small effect at 
low redshift where the constraints are weak anyway, at high redshift the moving lens signal significantly increases the 
uncertainty in the cluster mass. Kinetic SZ from cluster rotation is a big problem if this cannot be modelled, and 
reflects the fact that temperature mass measurements are probably kinetic SZ limited mm- At the 0.5 [iK aremin 
noise level the polarization does not give a useful constraint due to the polarization signal being about ten times 
smaller than the temperature. 

At high redshift there are expected to be almost no very massive clusters, so in Fig. 0 we show the effect for a 
more realistic cluster with M 200 = 2 x 10 14 /i -1 M© with ten times lower noise, 0.05/xK aremin (y/2 x 0.05 /xK aremin 
on the Stokes’ parameters). In the absence of confusing signals the temperature CMB gives excellent constraints at 
high redshift. However these are massively degraded by marginalization over other non-linear effects. The effect of 
more small scale power from inhomogeneous reionization significantly degrades the constraints at low redshift. We 
also show the constraint expected from a clean polarization observation, which at this noise level is competitive with 
what can realistically be achieved from the temperature. 
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FIG. 4: Mean log likelihood constraints for a M 200 = 10 15 /i -1 Mg cluster with concentration parameter c = 5 from galaxy 
lensing with 30 galaxies arcmin -2 and an intrinsic ellipticity dispersion <j u = 0.3. Solid contours show the constraint for (z) = 1 
when galaxy redshifts are known, dashed is the equivalent result when the redshift distribution is known. Contours show where 
the exponential of the mean log likelihood drops to 0.32 and 0.05 of the maximum. 
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FIG. 5: Mean log likelihood constraints for a M 200 = 10 15 /i -1 Mq cluster with concentration parameter c = 5. Dark solid is 
CMB lensing for 1/rK noise per 0.5arcmin pixel marginalized over the moving lens signal, red solid is for space-based galaxy 
lensing with 100 galaxies arcmin -2 , dashed lines are for current ground-based galaxy lensing with 30 galaxies arcmin -2 . All 
galaxies have known redshifts, a u = 0.3, and their distribution has (z) = 1. Contours show where the exponential of the mean 
log likelihood drops to 0.32 and 0.05 of the maximum. 
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FIG. 6 : Futuristic mean log likelihood constraints for a A /200 = 2 x 10 14 /i _1 Mq cluster with concentration parameter c = 5. 
Dark solid is for clean CMB polarization lensing for y/2 x 0.1/rK noise per 0.5arcmin pixel on the Stokes’ parameters, paler 
solid is for galaxy lensing with perfect redshift information and a total of 500 galaxies arcmin -2 with a Gaussian ellipticity 
distribution. All galaxies have a u = 0.2 and the distribution has (z) = 1.5. Contours show where the exponential of the mean 
log likelihood drops to 0.32 and 0.05 of the maximum. 


D. Galaxy lensing 

We assume that the sources are randomly distributed on the sky, and have a redshift probability distribution P(z)dz 
of the form suggested by Ref. f49lj 


P{z)dz 


V z 2 exp [- [z/zoY] 


(!) 


dz. 


(33) 


Choosing P(z)dz = (27/2)z 2 exp [— 3^] dz gives ( z) = 1, which is typical of current deep surveys. For comparison, we 
also consider a survey with ( z) = 1.5, the form of which is P(z)dz = 9.57z 2 exp [—2.91z 0 ' 78 ] d 2 . The distributions are 
skewed and peak at z = 2/3 and 2 « 0.85 respectively. 

For clusters with 2 < 0.3, and for typical (current) weak lensing observations with (z) = 1, the cluster’s lensing 
properties are quite insensitive to the actual redshift distribution of galaxies. The faint galaxy population can be 
safely approximated as lying on a sheet at (z), or more precisely at the redshift corresponding to (1/E cr it(~)), with 
Scrit(z) being the critical surface mass density. We do not however make this approximation here. The availability 
of photometric redshifts for galaxies is becoming increasingly likely with the advent of wide-held infrared imagers to 
complement observations from optical instruments, and parameter estimates from such data sets would be comparable 
to those with spectroscopic redshifts When there is no direct information about redshift, the galaxy’s magnitude 
can be used to determine whether it is likely to be behind the cluster, so P{z'\z) ~ P(z') for z’ larger than about the 
cluster redshift, and small or zero otherwise. 

For the M 200 = 2 x 10 14 /i _1 Af 0 cluster, the minimum radius of the aperture from which data is assumed to be 
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available is 0.145 Mpc, and for the M 200 = 1O 15 /i _1 M 0 cluster it is 0.35 Mpc. For the outer radius, we integrate until 
the shear signal is negligible, and convergence in constraints is obtained. For the most massive cluster at z = 0.25, 
this is equivalent to an angular scale of ~ 15 arcmin (or to that of a wide-held imager such as that on the ESO-MPG 
2.2m telescope). 

Fig.m shows expected constraints on massive cluster parameters from galaxy lensing with present day parameters. 
The constraints on clusters with z > 1 are very weak with current data, so we show only the useful constraints for 
lower redshift clusters. The figure shows the degrading effect of not knowing the source galaxy redshifts which have 
been marginalized over using Eq. for the z = 0.5 cluster this is a noticeable effect, however for z = 0.25 most 
of the galaxies lie well behind the cluster so knowing the individual redshifts gains very little. At higher redshifts 
the effect would be much more important, however future surveys able to give a useful constraint for higher redshift 
clusters would almost always have redshift information anyway. 

In Fig. 0 we compare current galaxy lensing constraints to those from possible future CMB temperature lensing 
observations. Even current galaxy lensing data can do better than futuristic CMB temperature lensing for low redshift 
clusters. However for z > 0.5 CMB lensing would be able to improve on current constraints if asymmetric kinetic SZ 
contamination and background secondaries did not destroy the signal. The fact that the CMB lensing constraints are 
weaker at low redshift when the deflection angle is larger may seem odd. However the point is that the deflection itself 
is unobservable, only the dipole-like variation can be cleanly distinguished from variations in the unlensed CMB. The 
constraints are therefore better for higher redshift clusters where the cluster has a smaller angular size, and hence the 
dipole-like pattern is more easily distinguished from larger scale variations in the unlensed CMB. 

If Fig. © we compare futuristic constraints from clean CMB polarization lensing with constraints from futuristic 
galaxy lensing with (z) = 1.5 (CMB polarization and temperature are compared in Fig.0). For clusters with redshifts 
well below the galaxy distribution peak, galaxy lensing should be much more powerful than CMB lensing. At redshifts 
higher than the peak of the galaxy distribution function, the galaxy lensing results however become much worse due 
to the lack of sources, and CMB polarization lensing may be a better way to measure the mass of clusters at redshift 
z > 1. At these redshifts the CMB lensing result is also less degenerate with the concentration parameter than at low 
redshift. 

Another source of noise in the determination of cluster mass profiles from galaxy lensing that we have neglected is 
the large scale structure along the line of sight. This has been estimated to cause as much as a factor of 2 increase in 
uncertainties in estimates of c and M 200 [Ml- Structure correlated with a cluster leads to a bias, with the masses of 
clusters in excess of 10 14 /i _ 1 M 0 being overestimated by about 20% 0. Techniques are being developed to alleviate 
these issues (e.g. see Ref. j^). For a particular cluster, if photometric or spectroscopic redshift information is available 
for the data field, then to some extent the importance of the large scale structure can be assessed by identifying any 
structures (e.g. galaxy groups) along the line of sight, or correlated large scale structure such as filaments. 


E. CMB lensing with non-symmetric profiles 

So far we have presented results for mean log likelihoods that give an idea of average results per cluster when you 
observe many. However one must remember that CMB temperature lensing constraint is highly variable due to the 
variability of the gradient behind any given cluster. If a cluster happens to be in the middle of a broad hot spot there 
will be essentially no constraint. 

The assumption of spherical cluster symmetry is also qualitatively important for the CMB temperature lensing 
results. If a cluster is observed in front of a pure gradient field, then deflection in the direction orthogonal to the 
gradient will be unconstrained. If we assume spherical symmetry this does not really matter as the profile can be 
constrained from the gradient-direction deflections. However if the cluster is non-symmetric, there is a fundamental 
degeneracy: CMB lensing of a pure gradient field cannot constrain orthogonal deflections at all. Of course the CMB 
is not a pure gradient, and in this case small scale CMB power actually helps to break this degeneracy to some extent. 
Furthermore CMB polarization provides two additional gradient directions, so the probability of all three gradients 
being aligned or very small is low. Polarization observations should be able to significantly help general cluster profile 
reconstruction. This is important because in reality the majority of clusters are expected to be prolate [§j rather than 
spherical, as well as having interesting substructure. 

General cluster profile reconstruction from CMB lensing is beyond the scope of this paper (see Ref. [23| for some 
work with CMB temperature). Here we consider a simple toy model to illustrate the issues: we try to reconstruct a 
stretched NFW deflection field 


a = £n • ccnfw A + ft± ' unfw ft± (34) 

where £ is a stretch parameter. We choose ft to be in the direction of the CMB temperature gradient or the direction 
orthogonal to it, where ftj_ is the corresponding orthogonal direction. For simplicity we fix the concentration parameter 
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FIG. 7: Constraints on the stretch parameter £ from four random realizations, for direction parallel (left) and orthogonal 
(right) to the direction of the background CMB temperature gradient. Solid lines are using CMB temperature only, dashed 
polarization only, filled contours show the combined result (accounting for the correlation). The fiducial cluster was at redshift 
one and has a spherically symmetric NFW profile with mass of M 200 = 2 x 10 14 /? — 1 Mg. The noise is 0.1/rK per 0.5arcmin 
pixel (%/2 x O.l^tK for the polarization) the concentration is fixed to c = 5. 


to c = 5, and only consider the idealized case where the temperature signal is purely CMB lensing. Note that this 
toy model is not a realistic lensing deflection angle field as in general it is not curl free. 

In Fig. 0 we show the constraints on the stretch in the two orthogonal directions for four different realizations of 
the underlying fields. The CMB temperature stretch constraint in the direction orthogonal to the CMB gradient is 
generally very poor. The CMB polarization constraints are slightly correlated, but generally in significantly different 
directions, meaning that joint constraints are significantly better than those from the CMB temperature alone. 

The presence of smaller scale CMB power from e.g. inhomogeneous reionization can actually improve the constraints 
from the temperature in the orthogonal direction due to the extra information available in the smaller scale power. 
However, as emphasised above, the CMB temperature signal is likely to be complicated due to SZ etc, so in practice 
the CMB temperature result is likely to be very much worse than that shown here. CMB polarization may be a much 
better probe. 


V. CONCLUSIONS 

Weak lensing is a valuable and promising method for studying cluster masses. Measurements of lensing shear using 
observations of lensed galaxies provides tight constraints for low redshift clusters. For clusters at higher redshift than 
the peak of the galaxy-redshift distribution the galaxy lensing constraints become much poorer, and CMB lensing can 
do better. However, even with simple spherically symmetric models the temperature lensing signal can be degraded 
by various other second order effects. For futuristic arcminute-resolution observations at low noise levels the CMB 
polarization lensing signal may be much cleaner and a more robust way to measure cluster properties. Measurements 
of lensing by high redshift clusters is therefore something that future CMB polarization missions may wish to aim to 
achieve. 

The results from galaxy lensing are limited by the intrinsic ellipticity dispersion of the galaxies, and the fact that 
there are only a finite number of sources behind the cluster. To do better one could try to find sources which have 
a higher number density. Possibilities include high redshift sources observed with 21cm, sources from the time of 
inhomogeneous reionization, and secondary doppler CMB signals from velocities after reionization mmm. in 
addition strong lensing can be used to help constrain the central region of the cluster profile. The ultimate limit 
from CMB polarization will depend on how efficiently spectral information can be used to clean out confusing signals, 
and the extent to which cluster substructure complicates the signal from quadrupole scattering. Future work could 
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investigate this using numerical simulations. 

With the appropriate increase in resolution and sensitivity, methods for cluster CMB lensing could be extended to 
constrain galaxy profiles (as discussed for the CMB temperature in Ref. [E3]). 
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APPENDIX A: GRADIENT APPROXIMATION 


Since gradients are just linear combinations of the (assumed) Gaussian underlying fields, the gradient fields are also 
Gaussian and hence fully described by their covariance. We can either work out the full distribution of the gradients 
directly, or just work out the covariances. Here we chose to do the latter, and calculate the covariances of the gradients 
at a point assuming statistical isotropy. The temperature gradient variance is given by 


2 Pq = (|V0| 2 ) 


(E 0 *™ VFi ™| 2 > 




1 

47r 


E / dm ‘mV 2 Y; m 


E ^ ^ 


21 + 1 
4n 


s-iO 


(Al) 


where we used statistical isotropy and orthogonality of the spherical harmonics. A polarization tensor P ab can be 
defined so that in a fixed orthonormal basis 


p — - 
- 1 ?.■? — ~ 


1 (Q U 


2 \ U -Q 


(A2) 


and may be expanded in terms of gradient and curl tensor spherical harmonics f5(| Y^ b ( lrn) and Y^+m)- The har¬ 
monic components describe the E- and R-modes of the polarization respectively. The correlation of the polarization 
divergence and temperature is given by 


Px = \(V a PabV b T) = I 

v Im 

1 v / (l + 2)! 21 + 1 x 

4 i V (*-2) ! 4?r 1 ’ 


(A3) 


where is the temperature-polarization cross-correlation power spectrum. The other terms are zero: 


(V°P q6 e b c V c 0) = 0. 


(A4) 


The variance of the polarization divergence is 

p P = (\/ a p ab v c p c b ) = \zv ++cf )■ ( A5 ) 

i 

Similar results can be derived for the variance of the irreducible polarization gradient 3-tensor. In terms of a fixed 
flat-sky basis we have 

(V,UV. y 0) = (V y UV x Q) = (V x gv,0) = -(\/yQ\/yQ) = Px, (A6) 


(|VQ| 2 ) = (|VU| 2 ) = 2Pp, 


(A7) 
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with other terms being zero, as can readily be verified by using an explicit flat-sky harmonic expansion. The stochastic 
quantities are (Vj,©, V y 0, ^7 X Q, V y Q, V X U, V y I7), which therefore have covariance 
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(A8) 


The correlation P\ is negative, and the (anti-)correlation Px /(PqPp) 1 / 2 is about 10%. On the small scales we are 
considering, photons are flowing into hot spots, which for a plane wave means that the correlated part of Q (defined 
with respect to the wavevector—parallel to V0- basis direction) is negative on the the hot crest |57lj. Q therefore 
increases in the direction of the cold crest, and hence that the gradients in Q and 0 are anti-correlated. Note for our 
conventions we have to flip sign of Cf- from CAMB/CMBFAST which use the conventions of Ref. 0- 

In a fixed flat sky basis, using the gradient approximation, some deflection field a(r) gives the lensed polarization 
field 


Pab(j) = Pab(r) + a(r) • VPab- (A9) 

A constant gradient is neither E or R-rnode, because making the E/B decomposition locally requires taking two 
gradients. This is reflected in the fact that the covariance is a function only of the sum of the power spectra C£ + Cf. 
On the flat sky the scalar harmonics are e ll x and tensor harmonics are (we follow the conventions of Ref. |T3] ) 

1) = ~V2 i (Q ih>e ,1 ' x (A10) 

Q C ab { 1) = -V2 e c (a i h) i c e ax , (All) 

where angle brackets denote the symmetric trace free part of the enclosed indices and 1 = 1/|1|. Assuming a suitably 
well behaved deflection field the lensed harmonic components are then 

E{ 1 ) = — 2a(l)- VP ab l< 0 i 6 > 

B( 1) = —2a(l)- VP ab e c H b n c . (A12) 

Averaging over (assumed Gaussian) background polarization gradients VP 0 {, we get 

(|A(1)| 2 ) = Pp|a(l)| 2 

(l-B(l)l 2 ) = Pp|«(1)| 2 . (A13) 

An arbitrary deflection field therefore gives identical power for each E and B mode on average when lensing a pure 
gradient field. If the deflection field has circular symmetry (as from a spherical cluster), the angular average of the E 
and B mode power are equal for any fixed polarization gradient: 

^ J d<h\E(i )\ 2 = ^1(|VQ| 2 + |VC/| 2 ) 

i- J d</> 1 |I?(l)| 2 = ^ (|VQ| 2 + |VC/| 2 ) . (A14) 

Cluster lensing therefore generates equal amplitude E and B in the gradient approximation. However there is very 
little power in the unlensed E or B polarization on cluster scales, so the lensed E contains almost as much information 
as the lensed B. We use the full likelihood function so there is in fact no need to use E and B. However for nearby 
large clusters the B mode signal may allow the cluster signal to be distinguished from unlensed CMB ‘noise’, so unlike 
in the temperature case cluster lensing may not be CMB noise limited for large cluster sizes. 

Since the temperature gradient defines a direction, we could chose to define the Q and U Stokes’ parameters with 
respect to this variable basis, e.g. where we align the x-axis with V0. In this basis 

(|V X 0| 2 )' = (|V0| 2 ) = 2(|V X 0| 2 ), 


(A15) 
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simply twice the variance of the x-component in a fixed basis, and similar results hold for the non-zero terms. 
By choosing this basis we are however making the whole distribution non-Gaussian, for example the marginalized 
distribution of S = (V^©)' = |V0| is P(6) oc 6 exp(— S 2 /2Pt). 
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